Picture for Wanqing Xu

Wanqing Xu

RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

Add code
Mar 26, 2026
Viaarxiv icon

An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs

Add code
Mar 15, 2026
Viaarxiv icon

Softmax Linear Attention: Reclaiming Global Competition

Add code
Feb 02, 2026
Viaarxiv icon

PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods

Add code
Jul 10, 2024
Figure 1 for PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods
Figure 2 for PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods
Figure 3 for PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods
Figure 4 for PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods
Viaarxiv icon